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(2) INFORMATION FOR SEQ ID NO : 1 : 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

AAGTCAATAC CAACAACTAT TAGCAGTCAT ACGTGCAAGA ACAAGAAAGA AGAGATTCAA 60 

AAAGCTAAAT AAGAGAAATC AAAACAAAAG GTATAGAACA CCCGAACAAC AAAATCAAAA 12 0 

CATCCAATCC ATTTTAAACA AAAATTCCAA AAGAGACCGG CAACACAACA AGCACCAAAC 180 

ACAATGCCAA CTTTAATACT GCTAATTATT ACAACAATGA TTATGGCATC TTCCTGCCAA 240 

ATAGATATCA CAAAACTACA GCATGTAGGT GTATTGGTCA ACAGTCCCAA AGGGATGAAG 300 

ATATCACAAA ACTTCGAAAC AAGATATCTA ATTTTGAGCC TCATACCAAA AATAGAAGAC 3 60 

TCTAACTCTT GTGGTGACCA ACAGATCAAA CAATACAAGA GGTTATTGGA TAGACTGATC 42 0 

ATCCCTCTAT ATGATGGATT AAGATTACAG AAAGATGTGA TAGTAACCAA TCAAGAATCC 4 80 

AATGAAAACA CTGATCCCAG AACAAGACGA TCCTTTGGAG GGGTAATTGG AACCATTGCT 54 0 

CTGGGAGTAG CAACCTCAGC ACAAATTACA GCGGCAGTTG CTCTGGTTGA AGCCAAGCAG 600 

GCAAAATCAG ACATCGAAAA ACTCAAAGAA GCAATCAGGG ACACAAACAA AGCAGTGCAG 660 

TCAGTTCAGA GCTCTATAGG AAATTTAATA GTAGCAATTA AATCAGTCCA AGATTATGTC 72 0 

AACAACGAAA TGGTGCCATC GATTGCTAGA CTAGGTTGTG AAGCAGCAGG ACTTCAATTA 7 80 

GGAATTGCAT T AAC AC AG C A TTAC TCAGAA TTAACAAACA TATTTGGTGA TAACATAGGA 84 0 

TCGTTACAAG AAAAAGGAAT AAAATTACAA GGTATAGCAT CATTATACCG CACAAATATC 90 0 

ACAGAAATAT TCACAACATC AACAGTTGAT AAATATGATA TCTATGATCT ATTATTTACA 960 

GAATCAATAA AGGTGAGAGT TATAGATGTT GATTTGAATG ATTAC TCAAT CACCCTCCAA 102 0 

GTCAGACTCC CTTTATTAAC TAGGCTGCTG AACACTCAGA TCTACAAAGT AGATTCCATA 1080 

TCATATAATA TCCAAAACAG AGAATGGTAT ATCCCTCTTC CCAGCCATAT CATGACGAAA 114 0 

GGGGCATTTC TAGGTGGAGC AGATGTCAAG GAATGTATAG AAGCATTCAG CAGTTATATA 12 00 

TGCCCTTCTG ATCCAGGATT TGTACTAAAC CATGAAATGG AGAGCTGCTT ATCAGGAAAC 1260 

ATATC CCAAT GTCCAAGAAC CACGGTCACA TCAGACATTG TTCCAAGATA TGCATTTGTC 132 0 

AATGGAGGAG TGGTTGCAAA CTGTATAACA ACCACCTGTA CATGCAACGG AATCGACAAT 13 80 



AGAATCAATC AACCACCTGA TCAAGGAGTA AAAATTATAA CACATAAAGA ATGTAATACA 144 0 

ATAGGTATCA ACGGAATGCT GTTCAATACA AATAAAGAAG GAACTCTTGC ATTCTACACA 1500 

C C AAATG AT A TAACACTAAA TAATTCTGTT GCACTTGATC CAATTGACAT ATCAATCGAG 1560 

CTTAACAAAG CCAAATCAGA TCTAGAAGAA TCAAAAGAAT GGATAAGAAG GTCAAATCAA 162 0 

AAAC TAGATT CTATTGGAAA CTGGCATCAA TCTAGCACTA CAATCATAAT TATTTTAATA 1680 

ATGATCATTA TATTGTTTAT AATTAATGTA ACGATAATTA CAATTGCAAT TAAGTATTAC 174 0 

AGAATTCAAA AGAGAAATCG AGTGGATCAA AATGACAAGC CATATGTACT AACAAACAAA 18 00 

TGACATATCT ATAGATCATT AGATATTAAA ATTATAAAAA ACTT 1844 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 53 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met Pro Thr Leu 
1 

Ser Cys Gin lie 
20 

Asn Ser Pro Lys 
35 

Leu lie Leu Ser 
50 

Asp Gin Gin lie 
65 

Pro Leu Tyr Asp 



Gin Glu Ser Asn 
100 

Gly Val He Gly 
115 

Thr Ala Ala Val 
130 



He Leu Leu He 
5 

Asp He Thr Lys 



Gly Met Lys He 
40 

Leu He Pro Lys 
55 

Lys Gin Tyr Lys 
70 

Gly Leu Arg Leu 
85 

Glu Asn Thr Asp 



Thr He Ala Leu 
12 0 

Ala Leu Val Glu 
135 



He Thr Thr Met 
10 

Leu Gin His Val 
25 

Ser Gin Asn Phe 



He Glu Asp Ser 
60 

Arg Leu Leu Asp 
75 

Gin Lys Asp Val 
90 

Pro Arg Thr Arg 
105 

Gly Val Ala Thr 



Ala Lys Gin Ala 
140 



He Met Ala Ser 
15 

Gly Val Leu Val 
30 

Glu Thr Arg Tyr 
45 

Asn Ser Cys Gly 



Arg Leu He He 
80 

He Val Thr Asn 
95 

Arg Ser Phe Gly 
110 

Ser Ala Gin He 
125 

Lys Ser Asp He 



Glu Lys Leu Lys Glu Ala He Arg Asp Thr Asn Lys Ala Val Gin Ser 



145 



150 



155 



160 



Val Gin Ser Ser He Gly Asn Leu He Val Ala He Lys Ser Val Gin 
165 170 175 

Asp Tyr Val Asn Asn Glu He Val Pro Ser He Ala Arg Leu Gly Cys 
180 185 190 

Glu Ala Ala Gly Leu Gin Leu Gly He Ala Leu Thr Gin His Tyr Ser 
195 200 205 

Glu Leu Thr Asn He Phe Gly Asp Asn He Gly Ser Leu Gin Glu Lys 
210 215 220 

Gly He Lys Leu Gin Gly He Ala Ser Leu Tyr Arg Thr Asn He Thr 
225 230 235 240 

Glu He Phe Thr Thr Ser Thr Val Asp Lys Tyr Asp He Tyr Asp Leu 
245 250 255 

Leu Phe Thr Glu Ser He Lys Val Arg Val He Asp Val Asp Leu Asn 
260 265 270 

Asp Tyr Ser He Thr Leu Gin Val Arg Leu Pro Leu Leu Thr Arg Leu 
275 280 285 

Leu Asn Thr Gin He Tyr Lys Val Asp Ser He Ser Tyr Asn He Gin 
290 295 300 

Asn Arg Glu Trp Tyr He Pro Leu Pro Ser His He Met Thr Lys Gly 
305 310 315 320 

Ala Phe Leu Gly Gly Ala Asp Val Lys Glu Cys He Glu Ala Phe Ser 
325 330 335 

Ser Tyr He Cys Pro Ser Asp Pro Gly Phe Val Leu Asn His Glu Met 
340 345 350 

Glu Ser Cys Leu Ser Gly Asn He Ser Gin Cys Pro Arg Thr Thr Val 
355 360 365 

Thr Ser Asp lie Val Pro Arg Tyr Ala Phe Val Asn Gly Gly Val Val 
370 375 380 

Ala Asn Cys He Thr Thr Thr Cys Thr Cys Asn Gly He Asp Asn Arg 
385 390 395 400 

He Asn Gin Pro Pro Asp Gin Gly Val Lys He He Thr His Lys Glu 
405 410 415 

Cys Asn Thr lie Gly He Asn Gly Met Leu Phe Asn Thr Asn Lys Glu 
420 425 430 

Gly Thr Leu Ala Phe Tyr Thr Pro Asn Asp He Thr Leu Asn Asn Ser 
435 440 445 



Val Ala Leu Asp Pro He Asp He Ser He Glu Leu Asn Lys Ala Lys 



450 455 460 

Ser Asp Leu Glu Glu Ser Lys Glu Trp lie Arg Arg Ser Asn Gin Lys 
465 470 475 480 

Leu Asp Ser lie Gly Asn Trp His Gin Ser Ser Thr Thr lie lie lie 
485 490 495 

He Leu He Met He He He Leu Phe He He Asn Val Thr He He 
500 505 510 

Thr He Ala He Lys Tyr Tyr Arg He Gin Lys Arg Asn Arg Val Asp 
515 520 525 

Gin Asn Asp Lys Pro Tyr Val Leu Thr Asn Lys 
530 535 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 
□ (A) LENGTH: 1833 base pairs 

J! (B) TYPE: nucleic acid 

j; (C) STRANDEDNESS : single 

Cj (D ) TOPOLOGY : 1 inear 

yj 

%l (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

AG AC AAATC C AAATTCGAGA TGGAATACTG GAAGCATACC AATCACGGAA AGGATGCTGG 60 

CAATGAGCTG GAGACGTCCA TGGCTACTAA TGGCAACAAG CTCACCAATA AGATAACATA 12 0 

TATATTATGG ACAATAATCC TGGTGTTATT ATCAATAGTC TTCATCATAG TGCTAATTAA 180 

TTC CATC AAA AGTGAAAAGG CTCATGAATC ATTGCTGCAA GACATAAATA ATGAGTTTAT 24 0 

GGAAATTACA GAAAAGATCC AAATGGCATC GGATAATACC AATGATC TAA TACAGTCAGG 3 00 

AGTGAATACA AGGCTTCTTA CAATTCAGAG TCATGTCCAG AATTATATAC CAATATCACT 3 60 

GACACAACAG ATGTCAGATC TTAGGAAATT CATTAGTGAA ATTACAATTA GAAATGATAA 42 0 

TCAAGAAGTG CTGCCACAAA GAATAACACA TGATGTGGGT ATAAAACCTT TAAATCCAGA 4 80 

TGATTTTTGG AGATGCACGT CTGGTCTTCC ATCTTTAATG AAAACTCCAA AAATAAGGTT 54 0 

AATGCCAGGG CCGGGATTAT TAGCTATGCC AACGACTGTT GATGGCTGTA TCAGAACTCC 600 

GTCCTTAGTT ATAAATGATC TGATTTATGC TTATACCTCA AATCTAATTA CTCGAGGTTG 660 

T C AGGAT AT A GGAAAATCAT ATCAAGTCTT ACAGATAGGG ATAATAACTG TAAACTCAGA 72 0 

CTTGGTACCT GACTTAAATC CCAGGATCTC TCATACTTTT AACATAAATG ACAATAGGAA 780 

GTCATGTTCT CTAGCACTCC TAAATACAGA TGTATATCAA CTGTGTTCAA CTCCCAAAGT 84 0 



£ 

n 



TGATGAAAGA TCAGATTATG CATCATCAGG CATAGAAGAT ATTGTACTTG ATATTGTCAA 900 

TTATGATGGC TCAATCTCAA CAACAAGATT TAAGAATAAT AACATAAGCT TTGATCAACC 960 

TTATGCTGCA CTATACCCAT CTGTTGGACC AGGGATATAC TACAAAGGCA AAATAATATT 102 0 

TCTCGGGTAT GGAGGTCTTG AACATCCAAT AAATGAGAAT GTAATCTGCA ACACAACTGG 1080 

GTGTCCCGGG AAAACACAGA GAGACTGCAA TCAGGCATCT CATAGTCCAT GGTTTTCAGA 1140 

TAGGAGGATG GTCAACTCTA TCATTGTTGT TGACAAAGGC TTAAACTCAA TTCCAAAATT 12 00 

GAAGGTATGG ACGATATCTA TGAGACAGAA TTACTGGGGG TCAGAAGGAA GGTTACTTCT 12 60 

ACTAGGTAAC AAGATCTATA TATATACAAG ATCCACAAGT TGGCATAGCA AGTTACAATT 132 0 

AGGAATAATT GATATTACTG ATTACAGTGA TATAAGGATA AAATGGACAT GGCATAATGT 13 80 

GCTATCAAGA CCAGGAAACA ATGAATGTCC ATGGGGACAT TCATGTCCAG ATGGATGTAT 1440 

AACAGGAGTA TATACTGATG CATATCCACT CAATCCCACA GGGAGCATTG TGTCATCTGT 1500 

CATATTAGAT TCACAAAAAT CGAGAGTGAA CCCAGTCATA ACTTACTCAA CAGCAACCGA 1560 

AAGAGTAAAC GAGCTGGCCA TCCGAAACAG AACACTCTCA GCTGGATATA CAACAACAAG 162 0 

CTGCATCACA CACTATAACA AAGGATATTG TTTTCATATA GTAGAAATAA ATCAGAAAAG 168 0 

CTTAAACACA CTTCAACCCA TGTTGTTCAA GACAGAGGTT CCAAAAAGCT GCAGTTAATC 174 0 

ATAATTAACC GCAATATGCA TTAACCTATC TATAATACAA GTATATGATA AGTAATCAGC 1800 

AATCAGACAA TAGACAAAAG GGAAATATAA AAA 183 3 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Glu Tyr Trp Lys His Thr Asn His Gly Lys Asp Ala Gly Asn Glu 
15 10 15 



Leu Glu Thr Ser Met Ala Thr Asn Gly Asn Lys Leu Thr Asn Lys lie 
20 25 30 



Thr Tyr lie Leu Trp Thr lie lie Leu Val Leu Leu Ser lie Val Phe 
35 40 45 



# 



He He Val Leu He Asn Ser He Lys Ser Glu Lys Ala His Glu Ser 
50 55 60 

Leu Leu Gin Asp He Asn Asn Glu Phe Met Glu He Thr Glu Lys He 
65 70 75 80 

Gin Met Ala Ser Asp Asn Thr Asn Asp Leu He Gin Ser Gly Val Asn 
85 90 95 

Thr Arg Leu Leu Thr He Gin Ser His Val Gin Asn Tyr He Pro He 
100 105 110 

Ser Leu Thr Gin Gin Met Ser Asp Leu Arg Lys Phe He Ser Glu He 
115 120 125 

Thr He Arg Asn Asp Asn Gin Glu Val Leu Pro Gin Arg He Thr His 
130 135 140 

Asp Val Gly He Lys Pro Leu Asn Pro Asp Asp Phe Trp Arg Cys Thr 
145 150 155 160 

Ser Gly Leu Pro Ser Leu Met Lys Thr Pro Lys He Arg Leu Met Pro 
165 170 175 

Gly Pro Gly Leu Leu Ala Met Pro Thr Thr Val Asp Gly Cys He Arg 
180 185 190 

Thr Pro Ser Leu Val He Asn Asp Leu He Tyr Ala Tyr Thr Ser Asn 
195 200 205 

Leu He Thr Arg Gly Cys Gin Asp He Gly Lys Ser Tyr Gin Val Leu 
210 215 220 

Gin He Gly He He Thr Val Asn Ser Asp Leu Val Pro Asp Leu Asn 
225 230 235 240 

Pro Arg He Ser His Thr Phe Asn He Asn Asp Asn Arg Lys Ser Cys 
245 250 255 

Ser Leu Ala Leu Leu Asn Thr Asp Val Tyr Gin Leu Cys Ser Thr Pro 
260 265 270 

Lys Val Asp Glu Arg Ser Asp Tyr Ala Ser Ser Gly He Glu Asp He 
275 280 285 

Val Leu Asp He Val Asn Tyr Asp Gly Ser He Ser Thr Thr Arg Phe 
290 295 300 

Lys Asn Asn Asn He Ser Phe Asp Gin Pro Tyr Ala Ala Leu Tyr Pro 
305 310 315 320 

Ser Val Gly Pro Gly He Tyr Tyr Lys Gly Lys He He Phe Leu Gly 
325 330 335 



Tyr Gly Gly Leu Glu His Pro He Asn Glu Asn Val He Cys Asn Thr 
340 345 350 



Thr Gly Cys Pro Gly Lys Thr Gin Arg Asp Cys Asn Gin Ala Ser His 
355 360 365 

Ser Pro Trp Phe Ser Asp Arg Arg Met Val Asn Ser lie lie Val Val 
370 375 380 

Asp Lys Gly Leu Asn Ser lie Pro Lys Leu Lys Val Trp Thr lie Ser 
385 390 395 400 

Met Arg Gin Asn Tyr Trp Gly Ser Glu Gly Arg Leu Leu Leu Leu Gly 
405 410 415 

Asn Lys lie Tyr lie Tyr Thr Arg Ser Thr Ser Trp His Ser Lys Leu 
420 425 430 

Gin Leu Gly lie lie Asp lie Thr Asp Tyr Ser Asp lie Arg lie Lys 
435 440 445 

Trp Thr Trp His Asn Val Leu Ser Arg Pro Gly Asn Asn Glu Cys Pro 
450 455 460 

CI 

tfl Trp Gly His Ser Cys Pro Asp Gly Cys lie Thr Gly Val Tyr Thr Asp 

4= 465 470 475 480 

Ala Tyr Pro Leu Asn Pro Thr Gly Ser He Val Ser Ser Val He Leu 
f|| 485 490 495 

?J Asp Ser Gin Lys Ser Arg Val Asn Pro Val He Thr Tyr Ser Thr Ala 

~ 500 505 510 



Thr Glu Arg Val Asn Glu Leu Ala He Arg Asn Arg Thr Leu Ser Ala 
515 520 525 

Gly Tyr Thr Thr Thr Ser Cys He Thr His Tyr Asn Lys Gly Tyr Cys 
530 535 540 

Phe His He Val Glu He Asn Gin Lys Ser Leu Asn Thr Leu Gin Pro 
545 550 555 560 

Met Leu Phe Lys Thr Glu Val Pro Lys Ser Cys Ser 
565 570 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ATGGAGTTGC CAATCCTCAA AGCAAATGCA ATTACCACAA TCCTCGCTGC AGTCACATTT 



60 




TGCTTTGCTT CTAGTCAAAA CATCACTGAA 
AGCAAAGGCT ATCTTAGTGC TCTAAGAACT 
TTAAGTAATA TCAAGGAAAA TAAGTGTAAT 
CAAGAATTAG ATAAATATAA AAATGCTGTA 
CCAGCAGCAA ACAATCGAGC CAGAAGAGAA 
AATACCAAAA AAACCAATGT AACATTAAGC 
TTGTTAGGTG TTGGATCTGC AATCGCCAGT 
GAAGGAGAAG TGAACAAGAT CAAAAGTGCT 
TTATCAAATG GAGTTAGTGT CTTAACCAGC 
AAACAATTGT TACCTATTGT GAATAAGCAA 
Q ATAGAGTTCC AACAAAAGAA CAACAGACTA 

4; GCAGGTGTAA CTACACCTGT AAGCACTTAC 

ATCAATGATA TGCCTATAAC AAATGATCAG 

Si 

s>» GTTAGAC AG C AAAGTTACTC TATCATGTCC 

sss= 

M GTACAATTAC CACTATATGG TGTGATAGAT 

2 

CS CTATGTACAA CCAACACAAA AGAAGGGTCA 

: . 

I™ <; 

□ TGGTACTGTG ACAATGCAGG ATCAGTATCT 

P CAATCGAATC GAGTATTTTG TGACACAATG 

~* CTCTGCAATG TTGACATATT CAATCCCAAA 

GATGTAAGCA GCTCCGTTAT CACATCTCTA 
AAATGTACAG CATC C AATAA AAATCGTGGA 
TATGTATCAA ATAAAGGGGT GGACACTGTG 
AAGCAAGAAG GCAAAAGTCT CTATGTAAAA 
TTAGTATTCC CCTCTGATGA ATTTGATGCA 
CAGAGTTTAG CATTTATTCG TAAATCCGAT 
TCAACCACAA ATATCATGAT AACTACTATA 
TTAATTGCTG TTGGACTGCT CCTATACTGT 
AAGGATCAAC TGAGTGGTAT AAATAATATT 




GAATTTTATC AATCAACATG CAGTGCAGTT 12 0 

GGTTGGTATA CTAGTGTTAT AACTATAGAA 18 0 

GGAACAGATG CTAAGGTAAA ATTGATGAAA 24 0 

ACAGAATTGC AGTTGCTCAT GCAAAGCACA 300 

CTACCAAGGT TTATGAATTA TACACTCAAC 360 

AAGAAAAGGA AAAGAAGATT TCTTGGTTTT 42 0 

GGCATTGCTG TATCTAAGGT CCTGCACTTA 4 80 

CTACTATCCA CAAACAAGGC CGTAGTCAGC 54 0 

AAAGTGTTAG ACCTCAAAAA CTATATAGAT 600 

AGCTGCAGAA TATCAAATAT AGAAACTGTG 660 

CTAGAGATTA CCAGGGAATT TAGTGTTAAT 72 0 

ATGTTAACTA ATAGTGAATT ATTGTCATTA 780 

AAAAAGTTAA TGTCCAACAA TGTTCAAATA 84 0 

ATAATAAAAG AGGAAGTCTT AGCATATGTA 900 

ACACCTTGTT GGAAATTACA CACATCCCCT 960 

AACATCTGTT TAACAAGAAC TGACAGAGGA 102 0 

TTCTTCCCAC AAGCTGAAAC ATGTAAAGTT 1080 

AACAGTTTAA CATTACCAAG TGAAGTAAAT 114 0 

TATGATTGTA AAATTATGAC TTCAAAAACA 12 00 

GGAGCCATTG TGTCATGCTA TGGCAAAACT 1260 

ATCATAAAGA CATTTTCTAA CGGGTGTGAT 132 0 

TCTGTAGGTA ACACATTATA TTATGTAAAT 13 80 

GGTGAACCAA TAATAAATTT CTATGACCCA 144 0 

TCAATATCTC AAGTCAATGA GAAGATTAAC 150 0 

GAATTATTAC ATAATGTAAA TGCTGGTAAA 1560 

ATTATAGTGA TTATAGTAAT ATTGTTATCA 162 0 

AAGGC CAGAA GCACACCAGT CACACTAAGC 1680 

GCATTTAGTA ACTGAATAAA AATAGCACCT 174 0 



AATCATGTTC TTACAATGGT TTACTATCTG CTCATAGACA ACCCATCTAT CATTGGATTT 
TCTTAAAATC TGAACTTCAT CGAAACTCTT AT CTATAAAC CATCTCACTT ACACTATTTA 
AGTAGATTCC TAGTTTATAG TTATAT 



1800 
1860 
1886 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Glu Leu Pro lie Leu Lys Ala Asn Ala lie Thr Thr lie Leu Ala 
15 10 15 

Ala Val Thr Phe Cys Phe Ala Ser Ser Gin Asn lie Thr Glu Glu Phe 
20 25 30 

Tyr Gin Ser Thr Cys Ser Ala Val Ser Lys Gly Tyr Leu Ser Ala Leu 
35 40 45 

Arg Thr Gly Trp Tyr Thr Ser Val lie Thr lie Glu Leu Ser Asn lie 
50 55 60 

Lys Glu Asn Lys Cys Asn Gly Thr Asp Ala Lys Val Lys Leu Met Lys 
65 70 75 80 

Gin Glu Leu Asp Lys Tyr Lys Asn Ala Val Thr Glu Leu Gin Leu Leu 
85 90 95 

Met Gin Ser Thr Pro Ala Ala Asn Asn Arg Ala Arg Arg Glu Leu Pro 
100 105 110 

Arg Phe Met Asn Tyr Thr Leu Asn Asn Thr Lys Lys Thr Asn Val Thr 
115 120 125 

Leu Ser Lys Lys Arg Lys Arg Arg Phe Leu Gly Phe Leu Leu Gly Val 
130 135 140 

Gly Ser Ala lie Ala Ser Gly lie Ala Val Ser Lys Val Leu His Leu 
145 150 155 160 

Glu Gly Glu Val Asn Lys lie Lys Ser Ala Leu Leu Ser Thr Asn Lys 
165 170 175 

Ala Val Val Ser Leu Ser Asn Gly Val Ser Val Leu Thr Ser Lys Val 
180 185 190 



Leu Asp Leu Lys Asn Tyr lie Asp Lys Gin Leu Leu Pro lie Val Asn 
195 200 205 



Lys Gin Ser Cys Arg lie Ser Asn lie Glu Thr Val lie Glu Phe Gin 
210 215 220 



His Lys Asn Asn Arg Leu Leu Glu lie Thr Arg Glu Phe Ser Val Asn 
225 230 235 240 

Ala Gly Val Thr Thr Pro Val Ser Thr Tyr Met Leu Thr Asn Ser Glu 
245 250 255 

Leu Leu Ser Leu lie Asn Asp Met Pro lie Thr Asn Asp Gin Lys Lys 
260 265 270 

Leu Met Ser Asn Asn Val Gin lie Val Arg Gin Gin Ser Tyr Ser lie 
275 280 285 

Met Ser lie lie Lys Glu Glu Val Leu Ala Tyr Val Val Gin Leu Pro 
290 295 300 

Leu Tyr Gly Val lie Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro 
305 310 315 320 

Leu Cys Thr Thr Asn Thr Lys Glu Gly Ser Asn lie Cys Leu Thr Arg 
325 330 335 

Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe 
340 345 350 

Pro Gin Ala Glu Thr Cys Lys Val Gin Ser Asn Arg Val Phe Cys Asp 
355 360 365 

Thr Met Asn Ser Leu Thr Leu Pro Ser Glu Val Asn Leu Cys Asn Val 
370 375 380 

Asp lie Phe Asn Pro Lys Tyr Asp Cys Lys lie Met Thr Ser Lys Thr 
385 390 395 400 

Asp Val Ser Ser Ser Val lie Thr Ser Leu Gly Ala lie Val Ser Cys 
405 410 415 

Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly lie lie 
420 425 430 

Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly Val Asp 
435 440 445 

Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Gin Glu Gly 
450 455 460 

Lys Ser Leu Tyr Val Lys Gly Glu Pro lie lie Asn Phe Tyr Asp Pro 
465 470 475 480 

Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser lie Ser Gin Val Asn 
485 490 495 



Glu Lys lie Asn Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser lie 
500 505 510 



Ser Gin Val Asn Glu 
515 

Ser Asp Glu Leu Leu 
530 

He Met He Thr Thr 
545 

Leu He Ala Val Gly 
565 



Lys He Asn Gin Ser Leu 
520 

His Asn Val Asn Ala Gly 
535 

He He He Glu He He 
550 555 

Leu Leu Leu Tyr Cys Lys 
570 



Ala Phe He Arg Lys 
525 

Lys Ser Thr Thr Asn 
540 

Val He Leu Leu Ser 
560 

Ala Arg Ser Thr Pro 
575 



Val Thr Leu Ser Lys Asp Gin Leu Ser Gly He Asn Asn He Ala Phe 
580 585 590 



Ser Asn 



(2) INFORMATION FOR SEQ ID NO : 7 



pe 



(i) SEQUENCE CHARACTERISTICS: 
3" (A) LENGTH: 92 0 base pairs 

~ll (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
ft! (D) TOPOLOGY: linear 

sz 

6 

B 

□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

i 

TGCAAACATG TCCAAAAACA AGGACCAACG CACCGCTAAG ACACTAGAAA AGACCTGGGA 60 
2l CACTCTCAAT CATTTATTAT TCATATCATC GGGCTTATAT AAGTTAAATC TTAAATCTGT 120 

j "5 
issv 

CI AG CACAAATC ACATTATCCA TTCTGGCAAT GATAATCTCA ACTTCACTTA TAATTACAGC 180 

CATCATATTC ATAGCCTCGG CAAACCACAA AGTCACACTA ACAACTGCAA TCATACAAGA 24 0 

TGCAACAAGC CAGATCAAGA ACACAACCCC AACATACCTC ACTCAGGATC CTCAGCTTGG 300 

AATCAGCTTC TCCAATCTGT CTGAAATTAC ATCACAAACC ACCACCATAC TAGCTTCAAC 3 60 

AACACCAGGA GTCAAGTCAA ACCTGCAACC CACAACAGTC AAGACTAAAA ACACAACAAC 42 0 

AACCCAAACA CAACCCAGCA AGCCCACTAC AAAACAACGC CAAAACAAAC CACCAAACAA 4 80 

AC CCAATAAT GATTTTCACT TCGAAGTGTT TAACTTTGTA CCCTGCAGCA TATGCAGCAA 54 0 

CAATCCAACC TGCTGGGCTA TCTGCAAAAG AATACCAAAC AAAAAACCAG GAAAGAAAAC 600 

CACCACCAAG CCTACAAAAA AACCAACCTT CAAGACAACC AAAAAAGATC TCAAACCTCA 660 

AACCACTAAA CCAAAGGAAG TACCCACCAC CAAGCCCACA GAAGAGC CAA CCATCAACAC 72 0 



CACCAAAACA AACATCACAA CTACACTGCT CACCAACAAC ACCACAGGAA ATCCAAAACT 
CACAAGTCAA ATGGAAACCT TCCACTCAAC CTCCTCCGAA GGCAATCTAA GCCCTTCTCA 
AGTCTCCACA ACATCCGAGC ACCCATCACA ACCCTCATCT CCACCCAACA CAACACGCCA 
GTAGTTATTA AAAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Met Ser Lys Asn Lys Asp Gin Arg 
1 5 

Trp Asp Thr Leu Asn His Leu Leu 
20 

Leu Asn Leu Lys Ser Val Ala Gin 
35 40 



Thr Ala Lys Thr Leu Glu Lys Thr 
10 15 

Phe lie Ser Ser Gly Leu Tyr Lys 
25 30 

lie Thr Leu Ser lie Leu Ala Met 
45 



He He Ser Thr 
50 

Ala Asn His Lys 
65 

Ser Gin He Lys 



Leu Gly He Ser 
100 

Thr He Leu Ala 
115 

Thr Thr Val Lys 
130 

Lys Pro Thr Thr 
145 

Asn Asp Phe His 



Ser Leu He He 
55 

Val Thr Leu Thr 
70 

Asn Thr Thr Pro 
85 

Phe Ser Asn Leu 



Ser Thr Thr Pro 
120 

Thr Lys Asn Thr 
135 

Lys Gin Arg Gin 
150 

Phe Glu Val Phe 
165 



Thr Ala He He 
60 

Thr Ala He He 
75 

Thr Tyr Leu Thr 
90 

Ser Glu He Thr 
105 

Gly Val Lys Ser 



Thr Thr Thr Gin 
140 

Asn Lys Pro Pro 
155 

Asn Phe Val Pro 
170 



Phe He Ala Ser 



Gin Asp Ala Thr 
80 

Gin Asp Pro Gin 
95 

Ser Gin Thr Thr 
110 

Asn Leu Gin Pro 
125 

Thr Gin Pro Ser 



Asn Lys Pro Asn 
160 

Cys Ser He Cys 
175 



Ser Asn Asn Pro Thr Cys Trp Ala He Cys Lys Arg He Pro Asn Lys 
180 185 190 



Lys Pro Gly Lys 
195 

Lys Thr Thr Lys 
210 

Val Pro Thr Thr 
225 

Thr Asn lie Thr 



Lys Leu Thr Ser 
260 

Asn Leu Ser Pro 
275 

Pro Ser Ser Pro 
290 



Lys Thr Thr Thr 
200 

Lys Asp Leu Lys 
215 

Lys Pro Thr Glu 
230 

Thr Thr Leu Leu 
245 

Gin Met Glu Thr 



Ser Gin Val Ser 
280 

Pro Asn Thr Thr 
295 



Lys Pro Thr Lys 



Pro Gin Thr Thr 
220 

Glu Pro Thr lie 
235 

Thr Asn Asn Thr 
250 

Phe His Ser Thr 
265 

Thr Thr Ser Glu 



Arg Gin 



Lys Pro Thr Phe 
205 

Lys Pro Lys Glu 



Asn Thr Thr Lys 
240 

Thr Gly Asn Pro 
255 

Ser Ser Glu Gly 
270 

His Pro Ser Gin 
285 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
ATCAATCAAA GGTCCTGTGA TAATAG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CATGACTTGA TAATGAG 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



□ 



(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AATTCATGGA GTTGCTAATC CTCAAAGCAA ATGCAATTAC CACAATCCTC ACTGCAGTCA 60 
CATTTTGTTT TGCTTCTGGT TCTAAG 86 



(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
ACTGGCATCA ATCTAGCACT ACATGAG 2 7 



yj (2) INFORMATION FOR SEQ ID NO: 13: 

fU 

41 (i) SEQUENCE CHARACTERISTICS: 

p (A) LENGTH: 13 6 base pairs 

(B) TYPE: nucleic acid 

~t*% (C) STRANDEDNESS: single 

J 89 ? (D) TOPOLOGY: linear 



U (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



AATTCATGCC AACTTTAATA CTGCTAATTA TTACAACAAT GATTATGGCA TCTTCCTGCC 60 
AAATAGATAT CACAAAACTA CAGCATGTAG GTGTATTGGT CAACAGTCCC AAAGGGATGA 12 0 

AGATATCACA AAACTT 13 6 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 



ATCATGGAGA TAATTAAAAT GAT AAC CATC TCGCAAATAA ATAAGTATTT TACTGTTTTC 60 
GTAACAGTTT TGTAATAAAA AAACCTATAA ATAG 94 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15 : 

ATCATGGAGA TAATTAAAAT GAT AAC CATC TCGCAAATAA ATAAGTATTT TACTGTTTTC 60 

GTAACAGTTT TGTAATAAAA AAACCTATAA ATATTCCGGA ATTCAGATCT GCAGCGGCCG 12 0 

CTCCATCTAG AAGGTACCCG G 141 

4» (2) INFORMATION FOR SEQ ID NO: 16: 

Uj (i) SEQUENCE CHARACTERISTICS: 
Rj (A) LENGTH: 31 base pairs 

11 (B) TYPE: nucleic acid 

p (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

a 

h 

Cl (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Q CATGACTAAT TCCATCAAAA GTGAAAAGGC T 31 

□ 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CAAGAAAAAG GAATAAAA 18 



(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 



r 5 * 



(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATTTCTGTGA TATTTGTGCG GTATAATGAT GCTATACCT 3 9 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

JpJ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

y] CAGGAGAAGG GTATCAAG 18 

FU (2) INFORMATION FOR SEQ ID NO: 20: 

Q (i) SEQUENCE CHARACTERISTICS: 

» (A) LENGTH: 17 base pairs 

p (B) TYPE: nucleic acid 

te? (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
AGGAGAAGGG TATCAAG 17 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ATCATGGAGA TAATTAAAAT GATAACCATC TCGCAAATAA ATAAGTATTT TACTGTTTTC 



60 



GTAACAGTTT TGTAATAAAA AAACCTATAA ATAG 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

Gin Glu Lys Gly lie Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

Gin Glu Lys Gly lie Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
ATCAATCTAG CACTACACAG 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1617 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 




ATGCCAACTT TAATACTGCT AATTATTACA 
GATATCACAA AACTACAGCA TGTAGGTGTA 
TCACAAAACT TCGAAACAAG ATATCTAATT 
AACTCTTGTG GTGACCAACA GATCAAACAA 
CCTCTATATG ATGGATTAAG ATTACAGAAA 
GAAAACACTG ATCCCAGAAC AAGACGATCC 
GGAGTAGCAA CCTCAGCACA AATTACAGCG 
AAATCAGACA TCGAAAAACT CAAAGAAGCA 
GTTCAGAGCT CTATAGGAAA TTTAATAGTA 
AACGAAATGG TGCCATCGAT TGCTAGACTA 

□ ATTGCATTAA CACAGCATTA CTCAGAATTA 
jj= TTACAAGAAA AAGGAATAAA ATTACAAGGT 

GAAATATTCA CAACATCAAC AGTTGATAAA 
ttl TCAATAAAGG TGAGAGTTAT AGATGTTGAT 

□ AGACTCCCTT TATTAACTAG GCTGCTGAAC 
Q TATAATATCC AAAACAGAGA ATGGTATATC 
U GCATTTCTAG GTGGAGCAGA TGTCAAGGAA 
2: CCTTCTGATC CAGGATTTGT ACTAAACCAT 

□ TCCCAATGTC CAAGAACCAC GGTCACATCA 
GGAGGAGTGG TTGCAAACTG TATAACAACC 
ATCAATCAAC CACCTGATCA AGGAGTAAAA 
GGTATCAACG GAATGCTGTT CAATACAAAT 
AATGATATAA CACTAAATAA TTCTGTTGCA 
AACAAAGCCA AATCAGATCT AGAAGAATCA 
CTAGATTCTA TTGGAAACTG GCATCAATCT 
ATCATTATAT TGTTTATAAT TAATGTAACG 
ATTCAAAAGA GAAATCGAGT GGATCAAAAT 

(2) INFORMATION FOR SEQ ID NO: 26: 




ACAATGATTA TGGCATCTTC CTGCCAAATA 60 

TTGGTCAACA GTCCCAAAGG GATGAAGATA 12 0 

TTGAGCCTCA TACCAAAAAT AGAAGACTCT 180 

TACAAGAGGT TATTGGATAG ACTGATCATC 24 0 

GATGTGATAG TAACCAATCA AGAATCCAAT 3 00 

TTTGGAGGGG TAATTGGAAC CATTGCTCTG 3 60 

GCAGTTGCTC TGGTTGAAGC CAAGCAGGCA 42 0 

ATCAGGGACA CAAACAAAGC AGTGCAGTCA 480 

GCAATTAAAT CAGTCCAAGA TTATGTCAAC 54 0 

GGTTGTGAAG CAGCAGGACT TCAATTAGGA 600 

ACAAACATAT TTGGTGATAA CATAGGATCG 660 

ATAGCATCAT TATACCGCAC AAATATCACA 72 0 

TATGATATCT ATGATCTATT ATTTACAGAA 780 

TTGAATGATT ACTCAATCAC CCTCCAAGTC 84 0 

ACTCAGATCT ACAAAGTAGA TTCCATATCA 90 0 

CCTCTTCCCA GCCATATCAT GACGAAAGGG 960 

TGTATAGAAG CATTCAGCAG TTATATATGC 102 0 

GAAATGGAGA GCTGCTTATC AGGAAACATA 108 0 

GACATTGTTC CAAGATATGC ATTTGTCAAT 114 0 

ACCTGTACAT GCAACGGAAT CGACAATAGA 12 00 

ATTATAACAC ATAAAGAATG TAATACAATA 12 60 

AAAGAAGGAA CTCTTGCATT CTACACACCA 132 0 

CTTGATCCAA TTGACATATC AATCGAGCTT 13 80 

AAAGAATGGA TAAGAAGGTC AAATCAAAAA 144 0 

AGCACTACAA TCATAATTAT TTTAATAATG 15 0 0 

ATAATTACAA TTGCAATTAA GTATTACAGA 1560 

G AC AAGC CAT ATGTACTAAC AAACAAA 1617 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SI 
ATGGAATACT GGAAGCATAC CAATCACGGA 
ATGGCTACTA ATGGCAACAA GCTCACCAAT 
CTGGTGTTAT TATCAATAGT CTTCATCATA 
GCTCATGAAT CATTGCTGCA AGACATAAAT 
CAAATGGCAT CGGATAATAC CAATGATCTA 
ACAATTCAGA GTCATGTCCA GAATTATATA 
£1 CTTAGGAAAT TCATTAGTGA AATTACAATT 

% l AGAATAACAC ATGATGTGGG TATAAAACCT 

yj 

Hi TCTGGTCTTC CATCTTTAAT GAAAACTCCA 

4= 

Q TTAGCTATGC CAACGACTGT TGATGGCTGT 

j*| CTGATTTATG CTTATACCTC AAATCTAATT 

l! TATCAAGTCT TACAGATAGG GATAATAACT 

Cl 

CCCAGGATCT CTCATACTTT TAACATAAAT 
p CTAAATACAG ATGTATATCA ACTGTGTTCA 

GCATCATCAG GCATAGAAGA TATTGTACTT 
ACAACAAGAT TTAAGAATAA TAACATAAGC 
TCTGTTGGAC CAGGGATATA CTACAAAGGC 
GAACATCCAA TAAATGAGAA TGTAATCTGC 
AGAGACTGCA ATCAGGCATC TCATAGTCCA 
ATCATTGTTG TTGACAAAGG CTTAAACTCA 
ATGAGACAGA ATTACTGGGG GTCAGAAGGA 
ATATATACAA GATCCACAAG TTGGCATAGC 
GATTACAGTG ATATAAGGAT AAAATGGACA 



!Q ID NO: 26: 

AAGGATGCTG GCAATGAGCT GGAGACGTCC 60 

AAGATAACAT ATATATTATG GACAATAATC 12 0 

GTGCTAATTA ATTCCATCAA AAGTGAAAAG 180 

AATGAGTTTA TGGAAATTAC AGAAAAGATC 240 

ATACAGTCAG GAGTGAATAC AAGGCTTCTT 3 00 

CCAATATCAC TGACACAACA GATGTCAGAT 3 60 

AGAAATGATA ATCAAGAAGT GCTGCCACAA 42 0 

TTAAATCCAG ATGATTTTTG GAGATGCACG 4 80 

AAAATAAGGT TAATGCCAGG GCCGGGATTA 540 

ATCAGAACTC CGTCCTTAGT TATAAATGAT 600 

ACTCGAGGTT GTCAGGATAT AGGAAAATCA 660 

GTAAACTCAG ACTTGGTACC TGACTTAAAT 72 0 

GACAATAGGA AGTCATGTTC TCTAGCACTC 78 0 

ACTCCCAAAG TTGATGAAAG ATCAGATTAT 84 0 

GATATTGTCA ATTATGATGG CTCAATCTCA 900 

TTTGATCAAC CTTATGCTGC ACTATACCCA 960 

AAAATAATAT TTCTCGGGTA TGGAGGTCTT 102 0 

AACACAACTG GGTGTCCCGG GAAAACACAG 10 8 0 

TGGTTTTCAG ATAGGAGGAT GGTCAACTCT 1140 

ATTCCAAAAT TGAAGGTATG GACGATATCT 12 0 0 

AGGTTACTTC TACTAGGTAA CAAGATCTAT 12 60 

AAGTTACAAT TAGGAATAAT TGATATTACT 132 0 

TGGCATAATG TGCTATCAAG ACCAGGAAAC 13 80 



AATGAATGTC CATGGGGACA TTCATGTCCA GATGGATGTA TAACAGGAGT ATATACTGAT 
GCATATCCAC TCAATCCCAC AGGGAGCATT GTGTCATCTG TCATATTAGA TTCACAAAAA 
TCGAGAGTGA ACCCAGTCAT AACTTACTCA ACAGCAACCG AAAGAGTAAA CGAGCTGGCC 
ATCCGAAACA GAACACTCTC AGCTGGATAT ACAACAACAA GCTGCATCAC ACACTATAAC 
AAAGGATATT GTTTTCATAT AGTAGAAATA AATCAGAAAA GCTTAAACAC ACTTCAACCC 
ATGTTGTTCA AGACAGAGGT TCCAAAAAGC TGCAG 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
ATGGAGTTGC CAATCCTCAA AGCAAATGCA ATTACCACAA TCCTCGCTGC AGTCACATTT 
TGCTTTGCTT CTAGTCAAAA CATCACTGAA GAATTTTATC AATCAACATG CAGTGCAGTT 
AGCAAAGGCT ATCTTAGTGC TCTAAGAACT GGTTGGTATA CTAGTGTTAT AACTATAGAA 
TTAAGTAATA TCAAGGAAAA TAAGTGTAAT GGAACAGATG CTAAGGTAAA ATTGATGAAA 
CAAGAATTAG ATAAATATAA AAATGCTGTA ACAGAATTGC AGTTGCTCAT GCAAAGCACA 
CCAGCAGCAA ACAATCGAGC CAGAAGAGAA CTACCAAGGT TTATGAATTA TACACTCAAC 
AATACCAAAA AAACCAATGT AACATTAAGC AAGAAAAGGA AAAGAAGATT TCTTGGTTTT 
TTGTTAGGTG TTGGATCTGC AATCGCCAGT GGCATTGCTG TATCTAAGGT CCTGCACTTA 
GAAGGAGAAG TGAACAAGAT CAAAAGTGCT CTACTATCCA CAAACAAGGC CGTAGTCAGC 
TTATCAAATG GAGTTAGTGT CTTAACCAGC AAAGTGTTAG ACCTCAAAAA CTATATAGAT 
AAACAATTGT TACCTATTGT GAATAAGCAA AGCTGCAGAA TATCAAATAT AGAAACTGTG 
ATAGAGTTCC AACAAAAGAA CAACAGACTA CTAGAGATTA CCAGGGAATT TAGTGTTAAT 
GCAGGTGTAA CTACACCTGT AAGCACTTAC ATGTTAACTA ATAGTGAATT ATTGTCATTA 
ATCAATGATA TGCCTATAAC AAATGATCAG AAAAAGTTAA TGTCCAACAA TGTTCAAATA 
GTTAGACAGC AAAGTTACTC TATCATGTCC ATAATAAAAG AGGAAGTCTT AGCATATGTA 
GTACAATTAC CACTATATGG TGTGATAGAT ACACCTTGTT GGAAATTACA CACATCCCCT 



CTATGTACAA CCAACACAAA AGAAGGGTCA AACATCTGTT TAACAAGAAC TGACAGAGGA 102 0 

TGGTACTGTG ACAATGCAGG ATCAGTATCT TTCTTCCCAC AAGCTGAAAC ATGTAAAGTT 1080 

CAATCGAATC GAGTATTTTG TGACACAATG AACAGTTTAA CATTACCAAG TGAAGTAAAT 1140 

CTCTGCAATG TTGACATATT CAATCCCAAA TATGATTGTA AAATTATGAC TTCAAAAACA 12 00 

GATGTAAGCA GCTCCGTTAT CACATCTCTA GGAGCCATTG TGTCATGCTA TGGCAAAACT 1260 

AAATGTACAG CATCCAATAA AAATCGTGGA ATCATAAAGA CATTTTCTAA CGGGTGTGAT 132 0 

TATGTATCAA ATAAAGGGGT GGACACTGTG TCTGTAGGTA ACACATTATA TTATGTAAAT 13 80 

AAGCAAGAAG GCAAAAGTCT CTATGTAAAA GGTGAACCAA TAATAAATTT CTATGACCCA 144 0 

TTAGTATTCC CCTCTGATGA ATTTGATGCA TCAATATCTC AAGTCAATGA GAAGATTAAC 15 00 

CAGAGTTTAG CATTTATTCG TAAATCCGAT GAATTATTAC ATAATGTAAA TGCTGGTAAA 15 60 

TCAACCACAA ATATCATGAT AACTACTATA ATTATAGTGA TTATAGTAAT ATTGTTATCA 162 0 

% TTAATTGCTG TTGGACTGCT CCTATACTGT AAGGCCAGAA GCACACCAGT CACACTAAGC 168 0 

|js AAGGATCAAC TGAGTGGTAT AAATAATATT GCATTTAGTA AC 1722 

"•I 

. 

Rl (2) INFORMATION FOR SEQ ID NO: 28: 

CI (i) SEQUENCE CHARACTERISTICS: 

s (A) LENGTH: 894 base pairs 

q (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
P% (D) TOPOLOGY: linear 

fcs# 

ess, 
i 5 

□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

ATGTCCAAAA ACAAGGACCA ACGCACCGCT AAG AC AC TAG AAAAGAC CTG GGACACTCTC 60 

AATCATTTAT TATTCATATC ATCGGGCTTA TATAAGTTAA ATCTTAAATC TGTAGCACAA 12 0 

ATCACATTAT CCATTCTGGC AATGATAATC TCAACTTCAC TTATAATTAC AGCCATCATA 18 0 

TTCATAGCCT CGGCAAACCA CAAAGTCACA CTAACAACTG CAATCATACA AGATGCAACA 24 0 

AGCCAGATCA AGAACACAAC CCCAACATAC CTCACTCAGG ATCCTCAGCT TGGAATCAGC 3 00 

TTCTCCAATC TGTCTGAAAT TACATCACAA ACCACCACCA TACTAGCTTC AACAACACCA 3 60 

GGAGTCAAGT CAAACCTGCA ACCCACAACA GTCAAGACTA AAAACACAAC AACAACCCAA 42 0 

ACACAACCCA GCAAGCCCAC TACAAAACAA CGCCAAAACA AACCACCAAA CAAAC CCAAT 4 80 

AATGATTTTC ACTTCGAAGT GTTTAACTTT GTACCCTGCA GCATATGCAG CAACAATCCA 54 0 



ACCTGCTGGG CTATCTGCAA AAGAATACCA AACAAAAAAC CAGGAAAGAA AACCACCACC 600 

AAGCCTACAA AAAAACCAAC CTTCAAGACA ACCAAAAAAG ATCTCAAACC TCAAACCACT 660 

AAACCAAAGG AAGTACCCAC CACCAAGCCC ACAGAAGAGC CAACCATCAA CACCACCAAA 72 0 

ACAAACATCA CAACTACACT GCTCACCAAC AACACCACAG G AAAT C C AAA ACTCACAAGT 780 

CAAATGGAAA CCTTCCACTC AACCTCCTCC GAAGGCAATC TAAGCCCTTC TCAAGTCTCC 84 0 

ACAACATCCG AGCACCCATC ACAACCCTCA TCTCCACCCA ACACAACACG CCAG 894 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CGTAGTTAGT TTCCAGGACA CTATTATCCT AG 32 



J; (2) INFORMATION FOR SEQ ID NO: 30: 

CI 

l_ (i) SEQUENCE CHARACTERISTICS: 
C) (A) LENGTH: 17 base pairs 

f=* (B) TYPE: nucleic acid 

p (C) STRANDEDNESS : single 

v,j (D ) TOPOLOGY : 1 inear 

□ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TGAACTATTA CTCCTAG 17 



CI 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GTACCTCAAC GATTAGGAGT TTCGTTTACG TTAATGGTGT TAGGAGTGAC GTCAGTGTAA 



60 




(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 



fc. S 



□ 



t • 



(A) LENGTH: 145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TAGTACCTCT ATTAATTTTA CTATTGGTAG AGCGTTTATT TATTCATAAA ATGACAAAAG 60 

CATTGTCAAA ACATTATTTT TTTGGATATT TATAAGGCCT TAAGTCTAGA CGTCGCCGGC 12 0 

GAGGTAGATC TTCCATGGGC CCTAG 145 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
TGATTAAGGT AGTTTTCACT TTTCCGAGTA C 31 

(2) INFORMATION FOR SEQ ID NO: 37: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
Cl (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TAAAGACACT ATAAACACGC CATATTACTA CGATATGGA 3 9 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



♦ 



AGGACAAAAG 10 
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H 
vD 

m 
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